
image source: google
Overview
The data Hotel Booking Demand Dataset were collected from two hotels (Resort hotel and City hotel) located in Portugal. It contains hotel booking information between the 1st of July of 2015 and the 31st of August 2017. For a speedy performance of our predictive model, we only select data from 2016 for training.
The dimension of the original published dataset is 119390 rows by 32 columns. After filtering instances from year 2016, the resulting dataset has 56707 rows.
We aim to explore the following questions using our exploratory data analysis:
- Where do the guests come from?
- How does price vary over the year?
- What are the potential factors which could influence the cancellation of both hotels?
Exploratory Data Analysis
Home Country of Guests
Comment: As shown from the choropleth maps and the donut chart, most guests(42.8%) are Portuguese, with British and French guest come second and third.
Price Fluctuation
Comment We observed that the ADR is pretty stable across the years for city hotel, which is reasonable according to the booking demand pattern. For resort hotel, we see a sharp increase in ADR from May to November with the price level peaking during summer. This finding is also intuitive since we expect the demand for resort hotel increases in summer.
Seasonal Fluctuation
CommentCity hotel has the most booking in Spring (May-June) and Autumn (October); the number of bookings for the resort hotel has less fluctuation compared to that of the city hotel. The booking demand goes down slightly from June to September for the resort hotel.